A visualization tool for interactive learning of large decision trees
نویسندگان
چکیده
Decision tree induction is certainly among the most applicable learning techniques due to its power and simplicity. Howevel; learning decision trees from large datasets, particularly in data mining, is quite different from learning from small or moderately sized datasets. When learning from large datasets, decision tree induction programs often produce very large trees. How to visualize efficiently trees in the learning process, particularly large trees, is still questionable and currently requires efficient tools. This paper presents an visualization tool for interactive learning of large decision trees, that includes a new visualization technique called l 2 S D (stands for Trees 2.5 Dimensions). After a brief discussion on requirements for tree visualizers and related work, the paper focuses on presenting developing techniques for the issues ( I ) how to visualize efficiently large decision trees; and (2 ) how to visualize decision trees in the learning process.
منابع مشابه
Interactive Exploration of Decision Tree Results
Our investigation aims at interactively exploring the decision tree results obtained by the machine-learning algorithms like C4.5. We propose an interactive graphical environment using the new radial tree layout, zoom/pan techniques and some existing visualization methods like explorer-like, hierarchical visualization, interactive techniques to represent large decision trees in a graphical mode...
متن کاملAn Interactive Machine Learning Framework
Machine learning (ML) is believed to be an effective and efficient tool to build reliable prediction model or extract useful structure from an avalanche of data. However, ML is also criticized by its difficulty in interpretation and complicated parameter tuning. In contrast, visualization is able to well organize and visually encode the entangled information in data and guild audiences to simpl...
متن کاملUntangling webs of interactions in classification models Popular Science Summary
Rule-based classifiers have one major advantage over other classes of supervised learning algorithms: interpretability. They provide a means to read into a model and find how the features co-act in order to come to a classification outcome. This in turn enables the researcher to visualize the feature interactions and evaluate the key features that discern between different decision classes. The...
متن کاملSim•TwentyFive: An Interactive Visualization System for PICU Decision Support
Pediatric intensive care unit (PICU) physicians need better decision support tools for finding and exploring similar patients to a given patient-of-interest. Recently, unsupervised machine learning methods have been shown to be effective at generating similarity scores between patients based on temporal physiological data. In this paper, we present Sim•TwentyFive, an interactive visualization d...
متن کاملUNDERSTANDING WHAT MACHINE LEARNING PRODUCES Part II: Knowledge visualization techniques
Researchers in machine learning use decision trees, production rules, and decision graphs for visualizing classification data. Part I of this paper surveyed these representations, paying particular attention to their comprehensibility for non-specialist users. Part II turns attention to knowledge visualization—the graphic form in which a structure is portrayed and its strong influence on compre...
متن کامل